MDL - Based Selection of theNumber of Components in Mixture

نویسندگان

  • Hiroshi Tenmoto
  • Mineichi Kudo
  • Masaru Shimbo
چکیده

A new method is proposed for selection of the optimal number of components of a mixture model for pattern classiication. We approximate a class-conditional density by a mixture of Gaussian components. We estimate the parameters of the mixture components by the EM (Expectation Maximization) algorithm and select the optimal number of components on the basis of the MDL (Minimum Description Length) principle. We evaluate the goodness of an estimated model in a trade-oo between the number of the misclassiied training samples and the complexity of the model.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Model Selection for Mixture Models Using Perfect Sample

We have considered a perfect sample method for model selection of finite mixture models with either known (fixed) or unknown number of components which can be applied in the most general setting with assumptions on the relation between the rival models and the true distribution. It is, both, one or neither to be well-specified or mis-specified, they may be nested or non-nested. We consider mixt...

متن کامل

Negative Selection Based Data Classification with Flexible Boundaries

One of the most important artificial immune algorithms is negative selection algorithm, which is an anomaly detection and pattern recognition technique; however, recent research has shown the successful application of this algorithm in data classification. Most of the negative selection methods consider deterministic boundaries to distinguish between self and non-self-spaces. In this paper, two...

متن کامل

On the Determination of Optimal Model Order for GMM-Based Text-Independent Speaker Identification

Gaussian mixture models (GMMs) are recently employed to provide a robust technique for speaker identification. The determination of the appropriate number of Gaussian components in a model for adequate speaker representation is a crucial but difficult problem. This number is in fact speaker dependent. Therefore, assuming a fixed number of Gaussian components for all speakers is not justified. I...

متن کامل

2 Denoising via Thresholding and Model Selection

In the context of wavelet denoising and compression, we study minimum description length (MDL) criteria for model selection criteria as exible forms of thresholding. Mixture MDL methods based on a single Laplacian, a two-piece Laplacian, and a generalized Gaussian prior are shown to be adaptive thresholding rules. While achieving mean squared error performance comparable with other popular thre...

متن کامل

Finite mixture spectrogram modeling for multipitch tracking using a factorial hidden Markov model

In this paper, we present a simple and efficient feature modeling approach for tracking the pitch of two speakers speaking simultaneously. We model the spectrogram features using Gaussian Mixture Models (GMMs) in combination with the Minimum Description Length (MDL) model selection criterion. This enables to automatically determine the number of Gaussian components depending on the available da...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998